PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D01G1935
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 442aa    MW: 50723.7 Da    PI: 6.7409
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D01G1935genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix76.25.1e-24117213185
     trihelix   1 rWtkqevlaLiearremeerlrrgk...........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfdql 85 
                  +Wt+++v++Li+a+++++e++  ++           + k++W++vsk+++erg+++sp+qC++k+++lnkrykk+ ++ +++ +++++++ +++d +
  Gh_D01G1935 117 KWTDKMVRLLITAVSYIGEDMAGDCgggirrqfavlQTKGKWKSVSKVIAERGYHVSPQQCEDKFNDLNKRYKKLYDMLGRGiSCQVVENPALLDVI 213
                  7*********************99888899999999*********************************************9999999999888766 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138379.5E-21115240No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 442 aa     Download sequence    Send to blast
MEGNLSRGII PGGSSFGGLD LQGSMMVHHR AQNPHNMHHH HHHPNPRRGT SAHPGIPLTA  60
GTMQNSDQPV TMIDYNKMEI GKCSVSDEDE PSFAEEGVDG HNDGNKGKKG SPWQRVKWTD  120
KMVRLLITAV SYIGEDMAGD CGGGIRRQFA VLQTKGKWKS VSKVIAERGY HVSPQQCEDK  180
FNDLNKRYKK LYDMLGRGIS CQVVENPALL DVIDYLTEKE KDDVRKILSS KHLFYEEMCS  240
YHNGNRLHLP HDLKLQRSLQ LALRRRDENE NDDVRRHQRD DLDDDDHDME TDDHDELEEN  300
HASHGDNRAI FGAPGGSTKR SRQSQVHEDA CFQKFLNSQD CNKSSFSSPP VAQADTNQVL  360
PDYSRAAWLQ KQWTESRSLQ LEEQKLQIQV DMLELEKQRF KWQRFSKKSD CELEKIRMGN  420
ERMKLENERM ALELKRKELA AD
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.110560.0boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012468067.10.0PREDICTED: uncharacterized protein LOC105786254
TrEMBLA0A0D2QHP80.0A0A0D2QHP8_G
STRINGPOPTR_0002s06930.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM50422752
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-152sequence-specific DNA binding transcription factors